Assessing the Readability of Medical Documents: A Ranking Approach
نویسندگان
چکیده
BACKGROUND The use of electronic health record (EHR) systems with patient engagement capabilities, including viewing, downloading, and transmitting health information, has recently grown tremendously. However, using these resources to engage patients in managing their own health remains challenging due to the complex and technical nature of the EHR narratives. OBJECTIVE Our objective was to develop a machine learning-based system to assess readability levels of complex documents such as EHR notes. METHODS We collected difficulty ratings of EHR notes and Wikipedia articles using crowdsourcing from 90 readers. We built a supervised model to assess readability based on relative orders of text difficulty using both surface text features and word embeddings. We evaluated system performance using the Kendall coefficient of concordance against human ratings. RESULTS Our system achieved significantly higher concordance (.734) with human annotators than did a baseline using the Flesch-Kincaid Grade Level, a widely adopted readability formula (.531). The improvement was also consistent across different disease topics. This method's concordance with an individual human user's ratings was also higher than the concordance between different human annotators (.658). CONCLUSIONS We explored methods to automatically assess the readability levels of clinical narratives. Our ranking-based system using simple textual features and easy-to-learn word embeddings outperformed a widely used readability formula. Our ranking-based method can predict relative difficulties of medical documents. It is not constrained to a predefined set of readability levels, a common design in many machine learning-based systems. Furthermore, the feature set does not rely on complex processing of the documents. One potential application of our readability ranking is personalization, allowing patients to better accommodate their own background knowledge.
منابع مشابه
Assessing the Readability of Patient Education Materials about Diabetes Available in Shiraz Health Centers
Introduction: Patient education materials are one of the important factors to improve the health literacy of patients with chronic diseases like diabetes and are employed in order to develop self-care skills. These materials will meet such objectives if they are understandable by their audiences. Hence, the aim of present study was to evaluate the readability of educational resources published ...
متن کاملContent-Based Readability Assessment: A Study Using A Syllabic Alphabetic Language (Thai)
Text readability is typically defined in terms of “grade level”; the expected educational level of the reader at which the text is directed. Mechanisms for measuring readability in English documents are well established; however this is not in case in many other languages, such as syllabic alphabetic languages. In this paper seven different mechanisms for assessing the readability of syllabic a...
متن کاملAssessing Readability of Patient Education Pamphlets in Training Hospitals in the City of Mashhad
Background: Patient education is taken into account as one of the key components of comprehensive care as well as one of the significant nursing functions in order to increase community health. In this respect, education materials and written texts can improve patient information up to 50% and consequently meet patient satisfaction. Readability is considered as an integral concept in patient ed...
متن کاملTask 2a: Team KU-CS: Query Coherence Analysis for PRF and Genomics Expansion
Laypeople who are not medical expert may formulate short query using words from their discharge summaries or long query that explain their health conditions. The different query styles should be treated with different query expansion mechanisms. This work is an adaptive query expansion based on the coherence among query terms. To provide users with more readability documents, the document compl...
متن کاملAssessing the relative reading level of sentence pairs for text simplification
While the automatic analysis of the readability of texts has a long history, the use of readability assessment for text simplification has received only little attention so far. In this paper, we explore readability models for identifying differences in the reading levels of simplified and unsimplified versions of sentences. Our experiments show that a relative ranking is preferable to an absol...
متن کامل